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Abstract 

We study restricted computation models related to the tree evaluation problem. The TEP 
was introduced in earlier work as a simple candidate for the {very) long term goal of separating 
L and LogDCFL. The input to the problem is a rooted, balanced binary tree of height h, 
whose internal nodes are labeled with binary functions on [A;] = {1, . . . , k} (each given simply 
as a list of k 2 elements of [k]), and whose leaves are labeled with elements of [A;]. Each 
node obtains a value in [k] equal to its binary function applied to the values of its children. 
The output is the value of the root. The first restricted computation model, called fractional 
pebbling, is a generalization of the black/white pebbling game on graphs, and arises in a natural 
way from the search for good upper bounds on the size of nondeterministic branching programs 
solving the TEP - for any fixed h, if the binary tree of height h has fractional pebbling cost at 
most p, then there are nondeterministic branching programs of size 0(kP) solving the height 
h TEP. We prove a lower bound on the fractional pebbling cost of d-ary trees that is tight to 
within an additive constant for each fixed d. The second restricted computation model we study 
is a semantic restriction on (non)deterministic branching programs solving the TEP - thrifty 
branching programs. Deterministic (resp. nondeterministic) thrifty BPs suffice to implement 
the best known algorithms, based on black pebbling (resp. fractional pebbling), for the TEP. 
In earlier work, for each fixed h a lower bound on the size of thrifty deterministic branching 
programs was proved that is tight for sufficiently large k. We give an alternative proof that 
achieves the same bound for all k and h. We show the same bound still holds in a less- 
restricted model, and also that gradually weaker lower bounds can be obtained for gradually 
weaker restrictions on the model. 



1 Introduction 

The motivations for this paper are those of [|BCM + 09al . and the goals are to extend and improve on 
the results given there (with the exception of Theorem |5l which appeared there verbatim). But from 
a wider view, what we want is to improve our understanding of L in the hope that this will help 
in eventually separating it from (apparently) larger classes. We study the tree evaluation problem 
(TEP), which was defined in llBCM + 09bl and shown to be in LogDCFL. 
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The function version of the Tree Evaluation problem FT h {k) is defined as follows. Let T h be 
the balanced binary tree of height h (see Fig. [T]). For each internal node i of T h the input includes 
a function / 4 : [k] x [A;] — )■ [&] specified as k 2 integers in [k] = {1, . . . , k}. For each leaf the input 
includes an integer in [A;]. We can then say that each internal tree node takes a value in [k] by 
applying its function to the values of its children. The function problem FT h {k) is to compute the 
value of the root, and the decision version BT h {k) is to determine whether this value is 1. 

Since BT h {k) 6 LogDCFL, it is not hard to show that for any unbounded function r{h), 
a lower bound of VL{k r ^) on the number of states for deterministic (resp. non-deterministic) 
branching programs solving FT h [k) or BT h [k) would separate LogDCFL and L (resp. NL)|J. 
To see this, note that inputs to BT h {k) can be encoded with (2 /l_1 — 1)A; 2 log k+2 h ~ 1 log k+0(l) = 
0(2 h k 2 log k) bits, so it suffices to consider polynomial bounding function that are the product of 
a polynomial in 2 h and a polynomial in k, which k r ^ is not. 

In HBCM+0 9bl. the TEP was defined more-generally on balanced d-ary trees, where the func- 
tions attached to internal nodes are of type [k] d — > [k). The motivation was that tight lower bounds 
for height 3 and all d can be proved [|BCM + 09bl , and proving the conjectured lower bound of 
Vt(k 7 / log k) states (with h = 4 and d = 3 fixed, so that the input size n(k) is 0(k 3 log k) bits or 
0(k 3 ) [k] -valued variables) for unrestricted deterministic BPs would beat the best known lower 
bound of VL{n 2 / (log n) 2 ) states for a problem in NP, achieved using Neciporuk's method HNec66l . 
Since we are focusing on restricted computation models here, there is little to gain in including the 
parameter d. That being said, the fractional pebbling lower bound proved in Section 14.11 is given 
for arbitrary d. 



2 Preliminaries 

We write [k] for {1, 2, . . . , k}. For h > 1 we use T h to denote the balanced binary tree of height h. 

Warning: Here the height of a tree is the number of levels in the tree, as opposed to the distance 
from root to leaf. Thus T| has just 3 nodes. 

We number the nodes of T h as suggested by the heap data structure. Thus the root is node 1, and 
in general the children of node i are nodes 2i, 2i + 1 (see Figure [T]). 

Definition 1 (Tree evaluation problems). 

An input / for either the function or decision version of the problem includes: for each 
internal node i of T h , a function // : [A;]x [A;] — > [k] represented as k 2 integers in [k], 
and for each leaf node i, an integer l\ e [k] . 

Function evaluation problem FT h {k): On input /, compute the value v[ G [k] of the 
root 1 of T h , where in general v\ — l\ if i is a leaf and v{ = //(u^*, fli+i) if * is an 
internal node. 

Boolean evaluation problem BT h (k): Accept / iff v\ — 1. 

'Of course, doing so would actually yield the stronger result: Nonuniform L % LogDCFL (resp. Nonuniform 
NL % LogDCFL). 
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Figure 1 : The height 3 binary tree T 3 with nodes numbered heap style. 
2.1 Branching programs 

We use the same branching program model as in [BC M+09al and IB CM+09bl . 



Definition 2 (Branching programs). A nondeterministic k-way branching program B computing 
a total function g : [k] m — > R, where R is a finite set, is a directed rooted multi-graph whose nodes 
are called states. Every edge has a label from [k]. Every state has a label from [m], except \R\ 
output sink states consecutively labeled with the elements from R. An input (x±, . . . , x m ) £ [k] m 
activates, for each 1 < j < m, every edge labeled Xj out of every state labeled j. A computation 
path on input x = (x±, . . . ,x m ) £ [k] m is a directed path consisting of edges activated by x 
which begins with the unique start state and either ends in the final state labeled g(xi, . . . , x m ) or 
is infinite. At least one such computation must end. The size of B is its number of states. B is 
deterministic k-way if every non-output state has precisely k outedges labeled 1, . . . , k. 

We say that B solves a decision problem (relation) if it computes the characteristic function of 
the relation. 

A fc-way branching program computing FT h (k) or BT h (k) requires k 2 fc-valued arguments for 
each internal node i of T h in order to specify the function fi, together with one A;-valued argument 
for each leaf. Thus in the notation of the above definition, FT h {k) : [k] m — >■ R where R = [k] and 
m = (2 11 - 1 - l)k 2 + 2 h ~ 1 . Also BT h (k) : [k] m {0, 1}. 

Important: Since we only study the tree evaluation problem (TEP) here, we give the input 
variables mnemonic names: fi(a,b) is an input variable (called an internal node variable) for 
every internal node i and a, b £ [k] and U is an input variable (called a leaf variable) for every leaf 
i. 

For fixed h we are interested in how the number of states required for a A;-way branching 
program to compute FT h {k) and BT h {k) grows with k. This is why we write h in the superscript 
of FT h (k) and BT h {k). We define #detFstates ft (fc) (resp. #ndetFstates /l (fc)) to be the mininum 
number of states required for a deterministic (resp. nondeterministic) A>way branching program to 
solve FT h (k). Similarly we define ^detBstates'^/c) and #ndetBstates ?t (/c) to be the number of 
states required to BT h {k). 

Thrifty programs are a restricted form of /c-way branching programs for solving tree evaluation 
problems, introduced in [|BCM + 09all . Thrifty programs efficiently simulate pebbling algorithms, 
and implement the best known upper bounds for #ndetBstates ft (£;) and #detFstates h (fc), and are 
within a factor of log k of the best known for #detBstates /l (fc). 
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Definition 3 (Thrifty branching program). A deterministic A;-way branching program which solves 
FT h {k) or BT h (k) is thrifty if during the computation on any input every query fi(a,b) to an 
internal node i of T h satisfies the condition that (a, b) is the tuple of correct values for the children 
of node i (i.e. v\ { = a and f^+i = b). A non-deterministic such program is thrifty if for every 
input every computation which ends in a final state satisfies the above restriction on queries. 

This is a strong restriction. For example, a deterministic thrifty BP cannot, for any internal 
node i, iterate over all the k 2 variables that define f, or even just two distinct variables. 



In ||BCM + 09al the following theorem is given, showing how upper bounds for black pebbling 



and fractional pebbling yield upper bounds for deterministic and nondeterministic branching pro- 
grams solving the TEP. The proof can be found in HBCM+09dl . 

Theorem ( llBCM + 09all ): 

(i) If T h can be black pebbled with p pebbles, then deterministic thrifty branching programs 
with 0(k p ) states can solve FT h {k) and BT h (k). 

(ii) If T h can be fractionally pebbled with p pebbles then non-deterministic thrifty branching 
programs can solve BT h {k) with 0(k p ) states. 

Also in [BCM + 09a|, the following lower bound was given for deterministic thrifty programs. 
The proof can be found in llBCM + 09cll . 

Theorem (IBC M + 09all ): For all h, for k > ( h 2 _J every deterministic thrifty branching program 
solving BT h {k) requires at least ~k h states. 

Theorem |4] in Section |3l which is a special case of Theorem [6] in Section I4.2L gives a small 
improvement on that result. The main improvement is that it gives a tight bound that holds for all 
pairs k and h, rather than requiring that k be much larger than h. The constant 1/2 also goes away: 
Theorem |4]: For all h, k every deterministic thrifty branching program solving BT h (k) requires 

at least k h states. 



2.2 Pebbling 

The pebbling game for dags was defined by Paterson and Hewitt [PH70J and was used as an ab- 
straction for deterministic Turing machine space in HCoo74ll . Black- white pebbling was introduced 
in HCS76B as an abstraction of non-deterministic Turing machine space (see HNor09il for a recent 
survey). Fractional pebbling was introduced in ||BCM + 09a l. 

Let us first define three versions of the pebbling game. We will not be proving anything about 
black- white pebbling directly, but fractional pebbling is a generalization of black-white pebbling, 
so it will be easier to define it first. The first is a simple 'black pebbling' game: A black pebble can 
be placed on any leaf node, and in general if all children of a node % have pebbles, then one of the 
pebbles on the children can be slid to i (this is a "black sliding move')'. Any black pebble can be 
removed at any time. The goal is to pebble the root, using as few pebbles as possible. The second 
version is 'whole' black-white pebbling as defined in [CS76J with the restriction that we do not 
allow "white sliding moves". Thus if node i has a white pebble and each child of i has a pebble 
(either black or white) then the white pebble can be removed. (A white sliding move would apply 
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Figure 2: An optimal fractional pebbling sequence for the height 3 tree using 2.5 pebbles, all 
configurations included. The grey half circle means the white value of that node is .5, whereas 
unshaded area means absence of pebble value. So for example in the seventh configuration, node 
2 has black value .5 and white value .5, node 3 has black value 1, and the remaining nodes all have 
black and white value 0. 

if one of the children had no pebble, and the white pebble on i was slid to the empty child. We do 
not allow this.) A white pebble can be placed on any node at any time. The goal is to start and end 
with no pebbles, but to have a black pebble on the root at some time. 

The third is fractional pebbling, which generalises whole black-white pebbling by allowing 
each node i to have a black value b(i) and a white value w(i) such that b(i) + w(i) < 1. The total 
pebble value (i.e. b(i) + w(i)) of each child of a node % must be 1 before the black value of i is 
increased or the white value of % is decreased. Figure |2] shows the sequence of configurations for 
an optimal fractional pebbling of the binary tree of height three using 2.5 pebbles. 

Our motivation for choosing these definitions is that we want pebbling algorithms for trees to 
closely correspond to k-way branching program algorithms for the tree evaluation problem. If, as 
in the survey by Razborov HRaz91L we instead used switching and rectifier networks instead of 
nondeterministic branching programs, where input variable labels are on the edges rather than the 
nodes, and a node can have any number of out-edges, and the size of the program is defined as the 
number of edges, then we would get better upper bounds by using a variant of fractional pebbling 
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where the following analogue of "white sliding moves" are allowed: Suppose you want to remove 
white value from an internal node i by first increasing the white value of one or both of the children 
of i. With white sliding moves, you can combine those two moves. A precise definition is given in 
[BC M + 09cll , where it is also shown that the height 4 binary tree can be fractionally pebbled using 



white sliding moves with 8/3 pebbles, from which it follows that there are switching and rectifier 



networks with 0(k s/3 ) edges that solve BT 4 (k). In contrast, it is shown in [|BCM + 09cl that 3 



pebbles are necessary and sufficient using our chosen definition of fractional pebbling. 

Now we give the formal definition of fractional pebbling, and then define the other two notions 
as restrictions on fractional pebbling. 

Definition 4 (Pebbling). A fractional pebble configuration on a rooted d-ary tree T is an assign- 
ment of a pair of real numbers (b(i),w(i)) to each node % of the tree, where 

0<b(i),w(i) (1) 
b(i) + w(i) < 1 (2) 

Here b(i) and w(i) are the black pebble value and the white pebble value, respectively, of i, and 
b(i) + w(i) is the pebble value of i. The number of pebbles in the configuration is the sum over 
all nodes i of the pebble value of i. The legal pebble moves are as follows (always subject to 
maintaining the constraints (OQ), ©): (i) For any node i, decrease b(i) arbitrarily, (ii) For any node 
i, increase w(i) arbitrarily, (iii) For every node i, if each child of i has pebble value 1, then decrease 
w(i) arbitrarily, increase b(i) arbitrarily, and simultaneously decrease the black pebble values of 
the children of i arbitrarily. □ 

A fractional pebbling of T using p pebbles is any sequence of (fractional) pebbling moves on 
nodes of T which starts and ends with every node having pebble value 0, and at some point the 
root has black pebble value 1, and no configuration has more than p pebbles. 

A whole black-white pebbling of T is a fractional pebbling of T such that b(i) and w(i) take val- 
ues in {0, 1} for every node i and every configuration. A black pebbling is a black- white pebbling 
in which w(i) is always 0. 

Notice that rule (iii) does not quite treat black and white pebbles dually, since the pebble val- 
ues of the children must each be 1 before any decrease of w(i) is allowed. A true dual move 
would allow increasing the white pebble values of the children so they all have pebble value 1 
while simultaneously decreasing w(i). In other words, we allow black sliding moves, but disallow 
white sliding moves. The reason for this (as mentioned above) is that non-deterministic branching 
programs can simulate the former, but not the latter. 

We use #Bpebbles(T), #BWpebbles(T), and #FRpebbles(T) respectively to denote the min- 
imum number of pebbles required to black pebble T, black-white pebble T, and fractional peb- 
ble T. Bounds for these values are given in llBCM + 09 all. For examp le, #Bpe bbles(T fc ) = h, 
#BWpebbles(T fc ) = \h/2] + 1, and #FRpebbles(T fc ) < h/2 + 1 (see HBCM+09cll for proofs). In 
particular #FRpebbles(T 3 ) = 2.5 (see Figure [2]). 



2 It is easy to show that we can require, without increasing the pebbling cost, that every type (ii) move to increase 
w(i) so that b(i) + w(i) = 1, and a type (iii) move to decrease w(i) to 0, but we will not need to use that fact here. 
3 And also for arbitrary degree d 
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3 Thrifty Branching Programs and Pebbling 



3.1 Upper Bound for Thrifty BPs 

It is easy to show that the determinstic thrifty BPs we get from pebbling have 0(k h ) states, for all 
h. The next theorem shows there is a simple expression for the exact number of states. We do not 
know how to beat this upper bound for any k and h, even by one. 

Theorem 1. There are (k + l) h state deterministic thrifty BPs solving FT h {k). 

Proof. For h = 1 you have the start state that queries the single input variable l x , with an edge out 
to each of the k output states. 

For h > 2, we start with k + 1 copies B , B x ..., of the BP that computes FT h ~ l {k). Here is 
the idea. We will use B to compute the value of the left subtree, and for each a G [k] we use B a 
to compute the value of the right subtree while remembering the value of the left subtree. At the 
level just before the output states, for each (a, b) G [k] 2 there is a state that queries fi(a, b). 

Now for the formal definition. We will combine B , B x , . . . , B^ in such a way that B x , Bk 
are pairwise disjoint, and for all a G [k], B and B a intersect in exactly one state; namely, for all 
a G [k], if g a is the output state of B labeled a, and q a is the start state of B a , then we remove 
g ,a an d for each of the now-dangling B -edges e, we connect the free end of e to q a . 

Now change the state labels of B so that whenever it queries fi(b x ,b 2 ) (resp. U) for some 
i G T h ~ x , it instead queries f a2 (i)(bi, b 2 ) (resp. l a2 a)) where a 2 maps node labels of T h ~ x to node 
labels of the subtree of T h rooted at node 2, in the obvious way. Similarly, for each a in [k], change 
the state labels of B a so that whenever it queries fi(b x , b 2 ) (resp. U) for some i G T h ~ x , it instead 
queries the variable f a3 (i)(b x , b 2 ) (resp. l az {i)) where a 3 is like a 2 except it maps node labels of 
T h ~ l to node labels of the subtree of T h rooted at node 3. 

Next, for each a, b in [k], change the b labeled output state of B a into a state that queries fi (a, b). 
Finally, add in the obvious way (there is only one way) k new output states that receive edges from 
the k 2 former output states of B x , B k . That completes the definition of the BP the computes 
FT h (k). Its size s(h, k) is given by 

s(h, k) = (k + 1) s(h - 1, ifc) - k + k = (k + l) h 

Where the —k is for the states q x , . . . , that get counted twice in the expression (k + 1) s(h — 1, k) 
and the +k is for the new output states. □ 

3.2 Upper Bound is Exact for Height 2 

We can show the previous upper bound (k + l) 2 is the exact state cost FT 2 {k) (note k + 1 is 
obviously exact for FT 1 (k)). In Section[5]we conjecture that (k + l) 3 is exact for height 3 as well. 

Theorem 2. Every BP solving FT 2 (k) has at least (k + l) 2 states. 

Proof. There are at least k 2 states that query the root, since for all a, b there is at least one state 
that queries /i(a, b). There are k output states. 
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Let E* be the inputs such that fx is + mod k. Let Q* be the states q such that q is the last 
leaf querying state on the computation path of some I E E*. We can show Q* has size at least k. 
Let g be the function that maps each input in E* to its last leaf querying state. Since \E*\ = k 2 , 
it suffices to show that < k for every q in Q*. Let I a ,b be the unique input in E* with 

(I2, Z|) = (a, b). Let g G Q* be arbitrary. Consider the case that q queries l 3 - the other case is 
similar. Then it suffices to show that for every b, there is at most one a such that I a b is in g~ 1 {q). 
Just observe that if two inputs in E* reach q then they have the same output state, and the label a' 
of the output state determines the unique a such that a' = a + b mod k. 

Now we want to show there is at least one state that queries a leaf and is not in Q*. Since all 
the inputs in E* agree on the variables, there is a unique state q that is the first leaf querying 
state visited by any of them. Because + mod k is a quasigroup, every input in E* must query l 2 
and l 3 each at least once. So for every I E E* there is a leaf querying state on the computation 
path of / after q that queries a leaf variable. Hence q ^ Q*. That is k 2 + k + k + 1 = (k + l) 2 
states total. □ 



3.3 Minimum-depth BPs are Thrifty 

Let the depth of a deterministic branching program be the maximum number of states visited by 
any input, with the output state included. The thrifty programs we get from pebbling have depth 
2 h , and it is easy to show that depth 2 h is required, regardless of size; just note that Lemma[T|holds 
without the depth restriction. In fact, we can show thrifty programs are the only fastest determinstic 
BPs solving BT h (k). 

Theorem 3. For all h, k every deterministic branching program of depth at most 2 h computing 
BT h (k) (or FT h (k)) is thrifty. 

Proof. Let E be the inputs all of whose internal node functions are quasigroups, and E\ the inputs 
that query each node exactly once. 

Lemma 1. Every input in Eq queries each of its thrifty variables. 

Proof. Suppose I E E does not query its thrifty i variable. Let X be the thrifty i variable of /. For 
each a ^ v{ there is an input I a identical to I except X Ia = a. Define the function F? : [k] — > [k] 

by 

{identity if i — 1 

//(*/, ifi = 2j 

fj(vlj,Fj) ifi = 2j + l 

Since F/ is a permutation, the root values of the inputs I a are all different from each other and from 
v 1 . If v[ = 1 then let J be any of the I a , and otherwise let J be the unique I a such that v[ a = 1. 
Then J E BT h [k) iff / ^ BT h [k). But their computation paths are the same, a contradiction. □ 

Lemma 2. Every input in Eq is thrifty (queries only its thrifty variables). 
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Proof. Because of the depth restriction, if an input queries each of its thrifty variables, then it is 
thrifty. So this lemma follows from LemmadJ □ 



Lemma 3. Every input in E\ is thrifty. 

Proof. Suppose there is some / in E 1 that is not thrifty. For each node j, let X, be the unique 
j variable that / queries. Since / is not thrifty, there is an internal node i such that Xj is not the 
thrifty fi variable of /. Let i* be such a node of minimum height. Since the computation path of / 
constrains only one value of each internal node function, we can choose an input J E E such that 
Xj = Xj for all nodes j. J is thrifty by Lemma [2] In particular, Xj* is the thrifty f* variable of 
J. Since Xj* is not the thrifty fi* variable of /, it must be that v 2i * ^ v 2i * or v 2i , +1 7^ v ii*+i- Wlog 
assume it is the first case. By our choice of i* and the assumption that / queries every node, we 
know / queries all its thrifty T 2i variables. Since the computation paths of / and J are identical, 
and J is thrifty, we have that / and J have the same thrifty T 2i variables. But then the only way 
to have v 2i * 7^ v 2i * is if there is a T 2i variable X that is thrifty for I (and so also for J) such that 
X 1 7^ X J . This contradicts the definition of J. □ 

Let E 2 := E — E 1 . Fix / in E 2 . Let P 1 be the maximum length initial segment of the 
computation path of / such that there is some J in E for which P 1 is also an initial segment of 
the computation path of J. Fix such a J. Since / is not in Ei, there must be some i such that I 
does not query any i variable. So by LemmaQ], we know P 1 cannot be the entire computation path 
of / (because then it would be the entire computation path of J). So the last state q t of P 1 cannot 
be its output state. Let q t+ i be the next state that / visits and e t the edge / takes from q t to qt+i- 
Let X t be the variable queried by q t and i t := var(X t ). There must be at least one J in E that 
follows P 1 (note the definition allows P 1 to be a single state). Let q' t+l be the next state visited by 
J . Since J disagrees with / on X t , it must be that q t is the first state on the computation path of 
/ that queries X t . On the other hand, there must have been a state q s before q t on P l that queries 
an i t variable X s distinct from X t ; otherwise, there would be a J' in E such that P 1 , e t , q t+ \ is 
an initial segment of the computation path of J', contradicting the maximality of P 1 . So now we 
know that J queries two distinct i t variables. But J is in E\ (since E C Ei), so this contradicts 
Lemma [21 □ 



3.4 Lower Bound for Thrifty BPs 

Now we give a tight lower bound for deterministic thrifty BPs. As discussed in section [27TI this 



improves on an earlier result in [BCM + 09a|, which gives a lower bound of \k h for all h and all 

k > (LY 

Theorem 4. For any h, k, every deterministic thrifty branching program solving BT h (k) has at 
least k h states. 

Fix a deterministic thrifty BP B that solves BT h {k). Let E be the inputs to B. Let Vars be the 
set of fc-valued input variables (so \E\ = A;' Vars l). Let Q be the states of B. If i is an internal node 
then the i variables are /j(a, b) for a, b G [k], and if i is a leaf node then there is just one i variable 
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li. We sometimes say variable" just as an in-line reminder that i is an internal node. Let var(g) 
be the input variable that q queries. Let node be the function that maps each variable X to the node 
i such that X is an i variable, and each state q to node(var(g)). When it is clear from the context 
that q is on the computation path of I, we just say "q queries i" instead of "q queries the thrifty % 
variable of 7". 

Fix an input I, and let P be its computation path. We will choose n states on P as critical states 
for I, one for each node. Note that I must visit a state that queries the root (i.e. queries the thrifty 
root variable of I), since otherwise the branching program would make a mistake on an input J 
that is identical to I except //KX) ■= k - ffiv^vi); hence J e BT^(k) iff I £ BT^(k). 
So, we can choose the root critical state for I to be the last state on P that queries the root. The 
remainder of the definition relies on the following small lemma: 

Lemma 4. For any J and internal node i, if J visits a state q that queries i, then for each child j 
ofi, there is an earlier state on the computation path of J that queries j. 

Proof Suppose otherwise, and wlog assume the previous statement is false for j = 2%. For every 
a ^ there is an input J a that is identical to J except = a. But the computation paths of J a 
and J are identical up to q, so J a queries a variable f(a, b) such that b = v J £ +1 and a ^ t) 2 J °, which 
contradicts the thrifty assumption. □ 

Now we can complete the definition of the critical states of /. For % an internal node, if q is the 
node % critical state for I then the node 2% (resp. 2% + 1) critical state for I is the last state on P 
before q that queries 2i (resp. 2i + 1). 

Now we assign a pebbling sequence to each state on P, such that the set of pebbled nodes 
in each configuration is a minimal cut of the tree or a subset of some minimal cut (and once it 
becomes a minimal cut, it remains so), and any two adjacent configurations are either identical, 
or else the later one follows from the earlier one by a valid pebbling move. This assignment can 
be described inductively by starting with the last state on P and working backwards. Note that 
implicitly we will be using the following fact: 

Fact 1. For any input I, if j is a descendant ofi then the node j critical state for I occurs earlier 
on the computation path of I than the node i critical state for I. 

The pebbling configuration for the output state has just a black pebble on the root. Assume we 
have defined the pebbling configurations for q and every state following q on P, and let q 1 be the 
state before q on P. If q' is not critical, then we make its pebbling configuration be the same as that 
of q. If q' is critical then it must query a node % that is pebbled in q. The pebbling configuration for 
q' is obtained from the configuration for q by removing the pebble from % and adding pebbles to 2% 
and 2% + 1 (if % is an internal node - otherwise you only remove the pebble from %). 

In the above definition of the pebbling configurations, consider the first critical state we define 
that queries a height 2 node (working backwards - so the first critical state we define queries the 
root). We use r 1 to denote this state and call it the supercritical state of /. Since the pebbling 
configurations up to r 1 (again, working backwards) are minimal cuts of the tree, and the children 
of node(r / ) are included, it is not hard to see that there must be at least h pebbled nodes. We refer 
to these nodes as the bottleneck nodes of /. Define the bottleneck path of r e R to be the path 
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from node(r) to the root. The bottleneck path of I E E is the bottleneck path of r 7 . This is the 
main property of the pebbling sequences that we need: 

Fact 2. For any input I, if non-root node i with parent j is pebbled at a state q on P 1 , then the 
node j critical state q' of I occurs later on P 1 , and there is no state ( critical or otherwise) between 
q and q' on P 1 that queries i. 

Let R be the states that are supercritical for at least one input. Let E r be the inputs with 
supercritical state r. Now we can state the main lemma. 

Lemma 5. For every r E R, there is an injective function from E r to [/c]' Vars ' _/l . 

The lemma gives us that \E r \ < fcl Vars l- ft for every r E R. Since {E r } r&R is a partition of 
E, there must be at least \E\/k^ 3rs ^ h = k h sets in the partition, i.e. there must be at least k h 
supercritical states. So the theorem follows from the lemma. 

Fix r £ R and let D := E r . Let i sc := node(r). Since r is thrifty for every / in D, there are 
values v 2isc and v 2i +1 such that v 2isc = v 2isc and v 2i +1 = v 2i +1 for every I in D. We are going 
to define a procedure Inter Ad v that takes as input a [k]- string (the advice), tries to interpret it as 
the code of an input in D, and when successful outputs that input. We want to show that for every 
I E D we can choose adv 7 E [fc]l Vars l -ft such that InterAdv (adv 7 )| = /. Of course, choosing 
adv 7 for each I yields the injective function required to prove the lemma. 

During the execution of InterAdv we maintain a current state q, a partial function v* from 
nodes to [k], and a set of nodes U\_. Once we have added a node to U\_, we never remove it, and 
once we have added v*(i) := a to the definition of v*, we never change v*(i). We have reached q 
by following a consistent partial computation path starting from r, meaning there is at least one 
input in D that visits exactly the states and edges that we visited between r and q. So initially 
q — r. Intuitively, v*(i)X = a for some a when we have "committed" to interpreting the advice 
we have read so-far as being the initial segment of some complete advice string adv 7 for an input / 
with v\ = a. Initially v* is undefined everywhere. As the procedure goes on, we may often have to 
use an element of the advice in order to set a value of v*; however, by exploiting the properties of 
the critical state sequences, for each I e D, when given the complete advice adv 7 for I there will 
be at least h nodes U[ that we "learn" without directly using the advice. Such an oppurtunity arises 
when we visit a state that queries some variable fi(bi, b 2 ) and we have not yet committed to a value 
for at least one of v*(2i) or v*(2i + 1) (if both then, we learn two nodes). When this happens, we 
add that child or children of % to (the L stands for "learned"). So initially U\_ is empty. There is 
a loop in the procedure InterAdv that iterates until \U\_\ = h. Note that the children of i sc will 
be learned immediately. Let v*(D) be the inputs in D consistent with v*, i.e. / € v*(D) iff I E D 
and v\ = v*(i) for every i E Dom(v*). 

Following is the complete pseudocode for InterAdv . We also state the most-important of the 
invariants that are maintained. 

Procedure InterAdv (a e [k]*): 
1: q := r, U\_ := 0, v* := undefined everywhere. 

2: Loop Invariant: If N elements of a have been used, then |Dom(t> *)| — N +\U L \. 
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3: while \U L \ < hdo 
4: i : = node(g) 

5: if i is an internal node and 2i ^ Dom(t>*) or 2i + 1 £ Dom(f *) then 

6: let bi, b 2 be such that var(g) = /i(6i, 62)- 

7: if 2i £ Dom(v*) then 

8: v*(2i) := 61 and U L := C/ L + 21 

9: end if 

10: if 2i + 1 £ Dom(V ) and \U L \ <h then 
11: v*(2i + 1) := b 2 and U L := Z7 L + (2i + 1). 

12: end if 
13: end if 

14: if i G" Dom(t>*) then 

15: let a be the next unused element of a. 

16: v*(i) := a. 

17: end if 

18: q := the state reached by taking the edge out of q labeled v*(i). 
19: end while 

20: let b be the next |Vars| — |Dom(t>*)| unused elements of a. 

21: let h, . . . , I\v*(d)\ be the inputs in v*(D) sorted according to some globally fixed order on E. 
22: if b is the t-largest string in the lexiocgraphical ordering of [A;]l Vars l _ l D ° m («*)l 5 and t < \v*(D)\, 
then return I t Q 

If the loop finishes, then there are at most \E\/\Dom(v*)\ = fcl Vars H Dom K)l i n p Uts i n v*(D). 
So for each of the inputs / enumerated on line [211 there is a way of setting a so that I will be 
chosen on linel22l 

Recall we are trying to show that for every I in D there is a string adv 7 G [A;]l Vars l^ /l such that 
InterAdv (a) I = I. This is easy to see under the assumption that there is such a string that makes 
the loop finish while maintaining the loop invariant; since the loop invariant ensures we have used 
|Dom(f*)| — h elements of advice when we reach line [201 an d since line 1201 is the last time when 
the advice is used, in all we use at most |Vars| — h elements of advice. To remove that assumption, 
first observe that for each /, we can set the advice to some adv 7 so that / G g(D) is maintained 
when InterAdv is run on a 1 . Moreover, for that adv 1 , we will never use an element of advice to 
set the value of a bottleneck node of /, and / has at least h bottleneck nodes. Note, however, that 
this does not necessarily imply that U[ (the h nodes U\_ we obtain when running InterAdv on 
adv 7 ) is a subset of the bottleneck nodes of /. Finally, note that we are of course implicitly using 
the fact that no advice elements are "wasted"; each is used to set a different node value. 

Corollary 1. For any h, k, every deterministic thrifty branching program solving BT h (k) has at 
least ^2 2 <i<h k l states. 

Proof. The previous theorem only counts states that query height 2 nodes. The same proof is easily 
adapted to show there are at least k h ~ l+2 states that query height / nodes, for I = 2, . . . , h. Those 

4 See after this code for argument that \v*(D)\ < fc|Vars|-|Dom(t,*)|^ 
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h — 1 state sets are disjoint, so we can sum the bounds. □ 

4 Main Results 

4.1 Fractional Pebbling Lower Bound 

The proof of Theorem [5] proceeds by reducing the problem of proving lower bounds on the frac- 
tional pebbling cost for balanced binary trees, to the problem of proving lower bounds on the 
black-white pebbling costs for a family of DAGs. In doing so, we are essentially discretizing the 
fractional pebbling problem; the main construction has a parameter c that determines how many 
nodes in the dag are used to "simulate" each node in the tree. We will use the next lemma (due to 
S. Cook) to conclude that we can always make c large enough that we don't "lose anything". 

Lemma 6. For every finite DAG there is an optimal fractional B/W pebbling in which all pebble 
values are rational numbers. (This result is robust independent of various definitions of pebbling; 
for example with or without sliding moves, and whether or not we require the root to end up 
pebbled.) 

Proof. Consider an optimal B/W fractional pebbling algorithm. Let the variables b Vjt and w V)t 
stand for the black and white pebble values of node v at step t of the algorithm. 

Claim: We can define a set of linear inequalities with 0-1 coefficients which suffice to ensure 
that the pebbling is legal. 

For example, all variables are non-negative, b v>t + w^t < L initially all variables are 0, and 
finally the nodes have the values that we want, node values remain the same on steps in which 
nothing is added or subtracted, and if the black value of a node is increased at a step then all its 
children must be 1 in the previous step, etc. 

Now let p be a new variable representing the maximum pebble value of the algorithm. We add 
an inequality for each step t that says the sum of all pebble values at step t is at most p. 

Any solution to the linear programming problem: 

Minimize p subject to all of the above inequalities 

gives an optimal pebbling algorithm for the graph. But Every LP program with rational coeffi- 
cients has a rational optimal solution (if it has any optimal solution). □ 

Now we are ready to prove the lower bound. We know this bound is not tight for heights at 
most 4. This is easy to see for height 2 (the bound should be d, but the theorem gives d/2 — 1), and 



proofs of the tight bounds for heights 3 and 4 are given in ||BCM + 09c I 



Theorem 5. The fractional pebbling cost for the degree d, height h tree is at least (d — l)h/2 — d/2. 

Proof. The high-level strategy for the proof is as follows. Given d and h, we transform the tree 
Tj into a DAG Gd,h such that a lower bound on #BWpebbles(G'd > / l ) gives a lower bound for 
#FRpebbles(T^). To analyze #BWpebbles(G ( i i / l ), we use a result of Klawe [Kla85|, who shows 
that for a DAG G that satisfies a certain "niceness" property, #BWpebbles(G') can be given in 
terms of #Bpebbles(G) (and the relationship is tight to within a constant less than one). The 
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black pebbling cost is typically easier to analyze. In our case, G^h does not satisfy the niceness 
property as-is, but just by removing some edges from G^h, we get a new DAG G' d h which is 
nice. We then show how to exactly compute #Bpebbles(G" d h ) which yields a lower bound on 
#BWpebbles(G , di/l ), and hence on #FRpebbles(Tj l ). 

We first motivate the construction G^h an d show that the whole black-white pebbling number 
of Gd,h is related to the fractional pebbling number of T£. 

We first use Lemma [6] to "discretize" the fractional pebble game. The following are the rules 
for the discretized game, where c is a parameter: 

• For any node v, decrease b(v) or increase w(v) by 1/c. 

• For any node v, including leaf nodes, if all the children of v have value 1, then increase b(v) 
or decrease w(v ) by 1/c. 

By Lemma [6l we can assume all pebble values are rational, and if we choose c large enough 
it is not a restriction that pebble values can only be changed by 1/c. Since sliding moves are not 
allowed, the pebbling cost for this game is at most one more than the cost of fractional pebbling 
with black sliding moves. 

Now we show how to construct G^n (f° r an example, see figure[3]). We will split up each node 
of into c nodes, so that the discretized game corresponds to the whole black-white pebble game 
on the new graph. Specifically, the cost of the whole black-white pebble game on the new graph 
will be exactly c times the cost of the discretized game on T%. 

In place of each node v of Gd,h has c nodes v[l], . . . , v[c\; having d of the v[i) pebbled 
simulates v having value d / c. In place of each edge (u, v) of is a copy of the complete bipartite 
graph (U, V), where U contains nodes u[l] . . . u[c] and V contains nodes v[l] . . .v[c). If u was a 
parent of v in the tree, then all the edges go from V to U in the corresponding complete bipartite 
graph. Finally, a new "root" is added at height h + 1 with edges from each of the c nodes at height 
/o So every node at height h — 1 and lower has c parents, and every internal node except for the 
root has dc children. 

To lower bound ^BWpebbles^^/J, we will use Klawe's result [Kla85J. Klawe showed that 
for "nice" graphs G, the black-white pebbling cost of G (with black and white sliding moves) is 
at least [#Bpebbles/2j + 1. Of course, the black-white pebbling cost without sliding moves is at 
least the cost with them. We define what it means for a graph to be nice in Klawe's sense. 

Definition 5. A DAG G is nice if the following conditions hold: 

1 . If ui, u 2 and u are nodes of G such that ui and u 2 are children of u (i.e., there are edges from 
ui and u 2 to u), then the cost of black pebbling u% is equal to the cost of black pebbling u 2 

2. If u\ and u 2 are children of u, then there is no path from ui to u 2 or from u 2 to u±. 

5 The reason for this is quite technical: Klawe's definition of pebbling is slightly different from ours in that it 
requires that the root remain pebbled. Adding a new root forces there to be a time when all c of the height h nodes, 
which represent the root of T^, are pebbled. Adding one more pebble to Gd,h changes the relationship between the 
cost of pebbling and the cost of pebbling Gd,h by a negligible amount. 
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Figure 3: Illustration to accompany the definition of G^h- This is £2,3 with parameter c = 3 




3. If u, tti, . . . , u m are nodes none of which has a path to another, then there are node-disjoint 
paths Pi, ... , P m such that Pi is a path from a leaf (a node with in-degree 0) to Ui and there 
is no path between u and any node in Pj. 

Gd,h is not nice in Klawe's sense. We will delete some edges from G^h to produce a nice graph 
G' dh and we will analyze #Bpebbles(G^ fc ). Note that a lower bound on #BWpebbles(G" d h ) is 
also a lower bound on #BWpebbles(G ? d i / l ). 

The following definition will help in explaining the construction of G' d h as well as for specify- 
ing and proving properties of certain paths. 

Definition 6. For u e Gd,h, let T d (u) be the node in T d such that T d {u) [i] = u for some i < c. For 
v, v' e Tj 1 , we say v < v' if f is visited before u' in an inorder traversal of T d . For u, u' E Gd,h, 
we say u < u' if T d {u) < T%(u') or if for some f G T£, u = v[i], u' = v[j), and i < j. 

G' d h is obtained from Gd,h by removing c — 1 edges from each internal node except the root, as 
follows (for an example, see figure H]). For each internal node v of T, consider the corresponding 
nodes v [1], v[2), . . . , d[c] of Gd,h- Remove the edges from v[i] to its i — 1 smallest and c — i largest 
children. So in the end each internal node except the root has c(d — 1) + 1 children. 
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We first analyze #Bpebbles(G^ h ) and then show that it is nice. We show that #Bpebbles(G" d h ) = 
c[(d — l)(h — 1) + 1]. Note that an upper bound of c[(d — l)(h— 1) + 1] is attained using a simple 
recursive algorithm similar to that used for the binary tree. 

For the lower bound, consider the earliest time t when all paths from a leaf to the root are 
blocked. Figure [5]is an example of the type of pebbling configuration that we are about to analyze. 
The last pebble placed must have been placed at a leaf, since otherwise t — 1 would be an earlier 
time when all paths from a leaf to the root are blocked. Let P be the newly-blocked path from a 
leaf to the root. Consider the set S = {u £ G' d h \ u £" P and u is a child of a node in P} of size 
c(d- l)(h - 1) + (c- 1) = c[(d-l)(h - 1) + 1] - 1 (the c- 1 is contributed by nodes at height 
h). We will give a set of mutually node-disjoint paths {P u } ue s such that P u is a path from a leaf 
to u and P u does not intersect P. At time t — 1, there must be at least one pebble on each P u , since 
otherwise there would still be an open path from a leaf to the root at time t. Also counting the leaf 
node that is pebbled at t gives c[(d-l)(h-l) + 1] pebbles. 

Definition 7. The left-most (right-most) path to u is the unique path ending at u determined by 
choosing the smallest (largest) child at every level. 

Definition 8. P(l) is the node of path P at height I, if it exists. 

For each u £ S at height I, if u is less than (greater than) P(l) then make P u the left-most 
(right-most) path to u. Now we need to show that the paths {P u } u es U {P} are disjoint. The 
following fact is clear from the definition of G' dh . 

Lemma 7. For any u,u'EG dh ,ifu<u' then the smallest child ofu is not a child ofu', and the 
largest child ofu' is not a child ofu. 

First we show that P u and P are disjoint. The following lemma will help now and in the proof 
that G' d h is nice. 

Lemma 8. For u,v £ G' dh with u < v, if there is no path from u to v or from v to u then the 
left-most path to u does not intersect any path to v from a leaf and the right-most path to v does 
not intersect any path to u from a leaf. 

Proof. Suppose otherwise and let P' u be the left-most path to u, and P' v a path to v that intersects 
P' u . Since there is no path between u and v, there is a height /, one greater than the height where 
the two paths first intersect, such that P^(l), P^{1) are defined and P^(l) < P^{1)- But then from 
Lemma [7] P^(l — 1) ^ P' v (l — 1), a contradiction. The proof for the second part of the lemma is 
similar. □ 

That P u and P are disjoint follows from using Lemma[8]on u and the sibling of u in P. 

Next we show that for distinct u, u' £ S, P u does not contain u' . Suppose it does. Assume P u 
is the left-most path to u (the other case is similar). Since u ^ u', there must be a height / such 
that P u (l) is defined and P u (l — 1) = u' . From the definition of S, we know P(l) is also a parent 
of u' . From the construction of P u , since we assumed P u is the left-most path to u, it must be that 
P u (l) < P(l)- But then Lemma[7]tells us that u' cannot be a child of P(l), a contradiction. 

The proof that P u and P u > do not intersect is by contradiction. Assuming that there are u,u' £ S 
such that P u and P u > intersect, there is a height /, one greater than the height where they first 
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Figure 5: A possible black pebbling bottleneck of G' 2 3 , with c = 3 

intersect, such that P u (l) ^ P u '(l)- Note that P u and P u < are both left-most paths or both right- 
most paths, since otherwise in order for them to intersect they would need to cross P. But then 
from Lemma [7]P U (Z — 1) =^ -P u '(/ — 1), a contradiction. 

See Figure [5] for an example of a bottleneck of the specified structure for G' d h corresponding 
to the height 3 binary tree, with c = 3: 

The last step is to prove that G' d h is nice. There are three properties specified in Definition 
[51 Property 2 is obviously satisfied. For property 1, the argument used to give the black pebbling 
lower bound of c[(d — l)(h — 1) + 1] can be used to give a black pebbling lower bound of c(d — 
1)(Z — 1) + 1 for any node at height / < h (the 1 is for the last node pebbled, and recall the root 
is at height h + 1), and that bound is tight. For property 3, choose Pi to be the left-most (right- 
most) path from Ui if is less than (greater than) u. Then use Lemma[8]on each pair of nodes in 
• • -,u m }. 

Since #Bpebbles(G^) = c[(d- l){h - 1) + 1], we have 

#BWpebbles(G di?t ) > #BWpebbles(G^) > c[(d - l)(h - 1) + l]/2 

and thus that the pebbling cost for the discretized game on T% is at least (d — i)(h — l)/2 + .5, 
which implies #FRpebbles(T d fc ) >{d-l){h- l)/2 - .5. □ 



4.2 Less-Thrifty Branching Programs 
4.2.1 Thrifty BPs with Wrong-Wrong Queries 

A variable fi(a, b) is wrong- wrong for input I iff a v 2i and b ^ i^i+i- The next theorem shows 
that querying wrong- wrong variables does not help. 

Theorem 6. For any h, k > 2, if B is a deterministic BP that solves BT h (k) such that each input 
only queries variables that are thrifty or wrong -wrong for it, then B has at least k h states. 
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Proof. We use the definitions and conventions introduced in the first paragraph of the proof of 
Theorem HI The proof of the following lemma is similar to that of Lemma |4l (page [TOl® 

Lemma 9. For any J and internal node i, there is at least one state q on the computation path of 
J that queries the thrifty i variable of J, and for every such q, for each child j of i, there is a state 
on the computation path of J before q that queries the thrifty j variable of J. 

Recall that for the thrifty lower bound, to each input we assigned one "critical state" for each 
node, and a pebbling configuration to each critical state, such that the n pebbling configurations 
made a valid pebbling sequence. This was so even if the thrifty branching program was constructed 
based on a pebbling sequence of length greater than n. Now we will not be selecting critical states, 
and we will assign pebbling sequences with length possibly greater than n. It may be helpful to 
note that this way of assigning pebbling sequences will have the following property: 

Remark Let S be a complete pebbling sequence for T h such that the root is pebbled only once, 
and a pebble is removed from a non-root node i only during a move that places a pebble on the 
parent of i. For any k, if Bs,k is the thrifty deterministic BP for solving FT h (k) that implements 
S in the natural wayS, then for every input / to Bs,k, we will assign pebbling sequence S to /. 

In the end, this will result in a cleaner proof; in particular, we will be able to say that when we 
interpret the advice for /, every node that gets "learned" is a bottleneck node of / (see Fact©. 

We define the pebbling sequence for I E E by following the computation path of / from be- 
ginning to end, associating the t-th thrifty state q t visited by / with the t-th pebbling configuration 
C t , such that C t +\ is either identical to C t or follows from C t by applying a valid pebbling move. 
There is also a last pebbling configuration that is not associated with any state. Let q±, . . . , q t * 
be the thrifty states on the computation path of /, up to the first state q t * that queries the thrifty 
root variable of /. Note that qi must query a leaf by Lemma [9] We associate q± with the empty 
configuration C\. 

Assume we have defined the configurations C\, . . . , C t associated with the first t < t* thrifty 
states, and assume Ci, . . . , Ct is a valid sequence of configurations (where adjacent identical con- 
figurations are allowed), but neither it nor any prefix of it is a complete pebbling sequence. We 
also maintain that for all if < t, if node(g t /) is internal, then its children are pebbled in C t > and it is 
not. Let % := node(q t ). By the I.H. i is not pebbled in C t . We define C t+ i by saying how to obtain 
it by modifying C t : 

1. If i is the root, then clearly t = t*, and by the I.H. nodes 2 and 3 are pebbled. Put a pebble 
on the root and remove the pebbles from nodes 2 and 3. This completes the definition of the 
pebbling sequence for /. 

2. If i is a non-root internal node, then by the I.H. both children of i are pebbled. For each child 
j of i: if there is a state q' after q t that queries the thrifty i variable of /, and no state between 

6 Also this lemma is proved in a more-general context on page [25] 

7 We are talking about a particular family of thrifty BPs {Bs.k}, without taking the time to give a precise definition. 
Bs.k has |5| non-output layers (where |5| is the number of moves in S), and if a pebble is placed on i in the Z-th 
move of S when there are p pebbles on the tree, then there are k p states in layer I of B$,k, all of which query a node i 
variable. 
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q t and q' that queries the thrifty j variable of /, then leave the pebble on j, and otherwise 
remove it. 

3. If i is not the root, then place a pebble on i iff there is a state q' after q t that queries the thrifty 
par(?) variable of / and there is no state between q t and q' that queries the thrifty i variable 
of/. 

Now we use the classic argument that h pebbles are required to black pebble T h . The children 
of the root are pebbled in C t *, so C t * trivially has the property that there is at least one node 
blocking every path from the root to a leaf. So consider the first t < t* such that C t has that 
property. Then % := node(g t _i) must be a leaf; otherwise there would be an earlier configuration 
with the aforementioned property. Consider the first t' > t such that q t i queries the thrifty par(i) 
variable of /; such a state must exist by the definition of the pebbling sequence for /. Then we make 
r 1 := q t > be the supercritical state of /. We refer to the nodes pebbled in C t > as the bottleneck 
nodes of /. Let R be the states that are supercritical for at least one input, and for each r G R let 
E r be the inputs with supercritical state r. For r G i? we write i r sc for node(r), and for I G E r we 
refer to i r sc as the Supercritical node for /. 

The definition of the bottleneck path BnPath r for r G R has not changed: it is the path from 
i r sc to the root. We mentioned earlier that every node we "learn" for an input I is a bottleneck node 
of /. This is due to the next fact. For any I and q on the computation path of /, let Path/(g) be the 
part of the computation path of I starting with q. 

Fact 3. i is a bottleneck node of I G E r iff it is not in BnPath r and there is a state q G Path 1 (r) 
that queries the thrifty par(i) variable ofi and no state before q in Path 1 (r) that queries the thrifty 
i variable of I. 

It will be convenient to have named the following four sets of nodes: 

Definition 9 (SiblBnPath r , RightPath^, Learnable,., Learnable*). 

• SiblBn Path r is the set of nodes that are the sibling of a node in BnPath r . 

• For % G SiblBnPath r , RightPathj is the path from % to the right-most leaf under % (when the 
tree is drawn in the canonical way). 

• Learnable r is the set of nodes {2i r 5C , 2i r 5C + 1} + UieSit>iBnPath r RightPath i5 i.e. the nodes not 
on the bottleneck path that are the descendent of a node on the bottleneck path. 

• Learnable* := Learnable r — {2i r sc ,2i r sc + 1}. 

It is not hard to see that every I G E r has at least one bottleneck node in Right Path^ for each of 
the h — 2 nodes j G SiblBnPath r . Additionally both children of i r sc are always bottleneck nodes of 
/, so / has at least h bottleneck nodes. 

Let G be the set of partial functions from Vars to [k]. At least when k = 2 these are commonly 
called restrictions (of BT h (k)), so we will refer to them as restrictions. For g G G and DC£we 
write g(D) for the inputs in D consistent with g - i.e. g(D) :— {I G D \ MX G Dom(g). X 1 = 
g(X) } . It will be convenient to further partition the sets E r by fixing some of the variables initially. 
This finer partitioning appears in the statement of the main lemma: 
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Lemma 10 (Main Lemma). For some integer M, for every supercritical state r e R, there is a set 
of restrictions G\ nit of size at most /cl Vars l- M such that {gm\t(E r )}g mit eGr n . z * a partition of E r and 
for every g init in G[ nit , there is an injective function from g\„\ t (E r ) to [k] M ~ h . 

Let us see why the theorem follows from the lemma. Since {g\n\t(E r )} ginit eGr n] is a partition of 
E r , and G[ nit has size at most fcl Vars l- M ; there must be some g* nit G G\ nA such that g* nit (E r ) has size 
at least |£' r |//c' Vars '~ M . On the other hand, from the lemma we get that every set g mit (E r ) in the 
partition has size at most k M ~ h . Hence 

\E r \/k^' M <\g* nit (E r )\<k M - h 

Rearranging gives \E r \ < k^ ars ^/k h = \E\/k h , and this holds for all r e R. Since {E r } reR is a 
partition of E, we get that R must have size at least k h . 

Proof of Main Lemma 

We use T to refer to the height h balanced binary tree, or to the set of its nodes. We use Tj to refer 
to the subtree of T rooted at node i, or to its nodes. For U a set of nodes, Vars(£7) is the set of 
input variables corresponding to the nodes in U - i.e Vars(C/) := {X e Vars | X = U or X = 
fi(a, b) for some i £ U and a, b e [k]}. For D C E there is a partial function % h-> from T to 
[&] such that wfj, = a iff v\ = a for every I m D. Similarly there is a partial function X \-t vf 
from Vars to [k] such that X D \. = a iff X 1 = a for every / in D. 

The constant M mentioned in the theorem is k(h — l)(h — 2)/2 + k 2 (h — 1) + h, but we are 
just writing that expression here for clarity; we will not be reasoning about it. For each r e R, we 
are going to define a set G[ nit of at most £;l Vars l- M restrictions where each g m - lt e Cf nit is defined on 
some set of |Vars| — M variables. Before giving the precise definition of the partition, let us see 
where the expression for M comes from. For (h — l)(h — 2)/2 = (h — 2) + (h — 3) + ... + 1 
internal nodes i we will fix all but k of the k 2 variables that define the corresponding function /j. 
For each of the h — 1 nodes on the bottleneck path BnPath r , we will not fix any of the k 2 variables 
that define the corresponding function. Lastly, there will be h unfixed leaf variables. 

Let Uf lxed be all the nodes except Learnable r + BnPath r . In the following drawing, which 
depicts the construction for the height 5 tree when i r sc = 15 is the right- most height 2 node, the 
pruned nodes (the nodes in the subtrees that would be at the ends of the dashed lines) are [/£ xed and 
the unmarked nodes plus the A-marked nodes are Learnable r . The D-marked nodes are BnPath r 
and will have no fixed variables. The A-marked nodes are SiblBnPath r and will have k 2 — k fixed 
variables. 
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Let G r be all the restrictions g with domain Vars(L^ xed ). For every g G G r , for every internal 
node % in Learnable r , we have that fff^ is defined since g is defined for every T 2i variable. For 
each g E G r let G r g be the set of extensions g' of g such that for all internal nodes i in Learnable r , 
for all a 7^ and all b, g' is defined on /j(a, b), and g'(E r ) is not empty. Finally, we take Grj" nit 



to be {J gedr G r , g . The size of G[ nit is at most fcl Vars l- M . 

Now fix r E R and g; n \ t G Gf nh and let D := g m -, t (E r ). From this point on, we drop "r" 
from Learnable r , Learnable*, SiblBnPath r , BnPath r , and i r sc . Since r is thrifty for every I in D, we 
have v% isc l and f^ sc+ i4 (note r queries the variable fi sc ( v 2i sc ^ v 2i sc +i))- Since we have now fixed 
D = g- m \ t (E r ), when g is an extension of g mit we just write vf and X 9 instead of if and X 9 ^ D ^. 

As in the proof of TheoremSl we will define a procedure called InterAdv (short for "Interpret 
Advice") that takes advice in the form of a [fc] -string and interprets it as the code of an input in D. 
Ultimately we want to show: 

Proposition 1. For every I 6 D, there is some restriction g that extends g m \x and some advice 
adv 7 of length at most M — h, such that InterAdv (adv 7 ) 4- = g and I e g(D) and |Dom(g) — 
Dom(g init )| > I adv 7 1 + h. 



The procedure InterAdv is given precisely in pseudocode on page [23] and relies on the sub- 
procedures given on page [23] and the following simple definition, which depends on the fixed input 
set D: 

Definition 10 (g constrains vi). We say g constrains v j if for some / 6 9(D), the thrifty i variable 
of / is in Dom(g) 

Recall how in the proof of Theorem [H while reading the advice adv 7 for / G D, we maintain 
a current state q G Path 7 (r) and build up a set of "learned nodes" which we called U\_. We are 
still building up a set of learned nodes, though in the pseudocode we have opted not to introduce 
a variable for that set explicitly. The learned nodes are just those nodes j such that at some point 
during the execution of InterAdv (adv 7 ), the subprocedure LearnNode is called with second 
argument j. In the thrifty proof, to characterize how we are interpreting the prefix of the advice 
that we have read so-far, we only need to record at most one value per node because every input 
is limited to querying its n thrifty variables (in the pseudocode we used the variable v*, a partial 
mapping from T to [k]). More precisely, we had that if v*(i)l = a after reading some advice 
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elements b, then v\ = a for every input / in E r whose complete advice adv has b as a prefix, i.e. 
for every input in v*(E r ). Now that inputs can query non-thrifty variables, instead of v* we will be 
building up a restriction g, where initially g = g mit . However, the meaning of g(X)\. = a is what 
one would expect by analogy with v*\ if g(X)\. = a after reading some advice elements b, then 
X 1 = a for every input I in D whose complete advice adv 7 has b as a prefix, i.e. for every input in 
g(D). As with v* before, once we define the value g takes on a given variable, we never change it. 

We first learn the children of i sc at r; we treat this as a special case now because it is the only 
time when we learn two nodes while examining one state. After that we learn a node in essentially 
the same situation as before: we reach a state q after reading some of the advice such that: 

1. q queries a variable fi(a 2 i, azi+i) mat is thrifty for every / 6 9(D), and^l 

2. For j = 2i or j = 2i + 1 (not both), g does not constrain Vj (j is the learned node). 

We need h — 2 such states after r for each input in D. Let us say q is a learning state for I E D 
if both those conditions hold or if q = r. In fact, by the properties of <7j n j t , and since after r we 
will only ever learn nodes in Learnable* = Uj £ sibiBnPath RightPath^, we can write the previous 
conditions in a more informative way: 

1. For some internal i E Learnable* + (BnPath — z sc ), q queries a variable fi(a 2i , a 2 j+i) that is 
thrifty for every I E g(D), and 

2. If i is in Learnable* then g does not constrain v 2 i+i. 

If i is in BnPath — i sc and j is the child of % in BnPath, then g does not constrain f s ibi(j)- 

We can be more specific still; later we will show that for each of the h — 2 nodes j E SiblBnPath, 
we will learn at least one node in RightPath^. 

Let us now explain what "learning a node" entails. Temporarily fix / E D. Suppose that while 
interpreting the advice for / we reach a state q E Path / (r) / that is a learning state for /. So q 
queries the variable fi(a 2 i, a 2 i+i) for some i in Learnable* + (BnPath — i sc ) and a 2i , a 2 i+i in [A;]. 
If i is in BnPath — i sc then let j be the child of i in BnPath, and otherwise let j be 2% + 1. We 
are learning node j. If j is an internal node, then first we use the advice, if necessary, to make g 
total on \Zars(T 2 j + T 2 j + \). After that, there is one variable X that is the thrifty j variable for every 
/ E g(D). So then we "learn" j by adding X i-> aj to g. The key point is that we have made 
progress since we used only m = |Dom(g) / Vars(T 2j + T 2 j +1 ) \ new elements of advice to define 
g on m + 1 new variables. 

The main thing we still need to show is that we can define adv 7 so that InterAdv (adv 7 ) will 
visit at least h — 2 learning states for / after r. As mentioned earlier, / has at least one bottleneck 
node in RightPath,- for each of the h — 2 nodes i E SiblBnPath. By Fact[3l for each of those 
bottleneck nodes j there is a state q\ in Path 7 (r) that queries the thrifty par(j) variable of /, and 
no state between r and gj that queries the thrifty j variable of /. 

For each i E SiblBnPath,., let q\ be the earliest state in Path 7 (r) among the states 

Wj I j £ RightPath^ and j is a bottleneck node of /} 
8 Here g is the current restriction. 
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and let ji be such that q\ = qj.. Then at least the nodes {j;}j e sibiBnPath will be learned, and 
specifically ji will be learned upon reaching qj.. To prove this, for par(jj) G Learnable* use Fact|3] 
together with the comments given in footnote \W\ on page[24l For par(jj) G BnPath — z sc , use the 
following fact (with j = sibl(jj) and f = par(jj)): 

Fact 4. For all I G D, if j is a non-root node in BnPath and j' is its ancestor in BnPath, then 
there are states in Path 1 (r) that query the thrifty j and j' variables of I, and the first such state 
for j occurs before the first such state for j'. 



Pseudocode for InterAdv and subprocedures 

The procedure Fill implements a very simple function: given inputs g, V (the advice string a and 
the current index into it are implicit arguments), it just uses the advice to define g on any variable in 
V on which it is not yet defined. We call Fill in two qualitatively distinct situations. One is when 
for some i, V is a single i variable X such that for every I G g(D),we have determined that either 
i is not a bottleneck node of / or X is not thrifty for /. That is the situation when we call Fill 
from InterAdv. The other situation occurs when LearnNode calls Fill on Vars(T 2 j U T 2j+ i) 
for some j that we have decided to learn. We do this because in order to learn j, we need g to be 
defined on enough input variables that the inputs in g(D) agree on the "name" of their thrifty j 
variable, i.e. we need v%jl and ff,- +1 4. 

Subprocedure Fill(# g G, V C Vars): 
1: let d\ , . . . , CL m be the next m = \V/Dom(g) \ elements of the advice string 
2: let X\, . . . , X m be V/Dom(g) sorted according to some globally fixed order on Vars 

3: add Xi (->• ai, . . . , X m !->• a m to g 

Subprocedure LearnNode^ g G, j G Learnable*, b e 
1: if j is not a leaf then 

FlLL(£,Vars(T 2j +T 2i+1 )) 

f.i ( r -ij ■''■{, ■ : j 



let X :=/,(<, vl 



else 

let X = lj 
end if 

add X i — y b to g 



Procedure InterAdv (a g [k]*): 

1: // Note the advice string a and the current index into it are implicit arguments in every call to 

Fill and LearnNode. 

2: q^r, g <- g ]nit 

3: while q is not an output state do 

4: i node(g), X ^— var(g) 

5: if X Dom(g) then 

6: if i = L c then 
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7: add l 2kc ^ v*£ and l 2isc+1 ^ v*£ +1 to g 

8: else if i G BnPath — ? sc then 

let j be the child of i in BnPath @ 
let a 2i , a 2 i+i be such that X = fi(a 2i , a 2i+1 ) 
if Vj.\, = a,j and g does not constrain f s ibi(j) then 

// Uses |Vars(descendants(sibl(j))) / Dom(g)| elements of advice: 
LearnNode (g, sibl(j), a s ibi(j)) 
else 

FlLL(g, {X}) // C/ses one element of advice. 
end if 
else // z e Learnable* 

if i is an internal node and g does not constraint v 2 i+i then 
let 6 be such that X = fi(v 9 2 f, b) 

// Uses |Vars(descendants(2z + 1)) / Dom(g)| elements of advice: 
LearnNode (g, 2i + 1, b) 
else 

FlLL(g, {X}) II Uses one element of advice. 
end if 
end if 
end if 

q the state reached by taking the edge out of q labeled g(X) 
end while 
return g 

□ 



4.2.2 Less-Thrifty BPs with Additional Queried Variables 

The previous result can be generalized to give gradually weaker lower bounds for gradually weaker 
restrictions on the model. For B a deterministic BP that solves BT h (k), for every state q of B that 
queries a variable f(a, b), let RightThrifty(g) be the set of integers a' (including a) such that there 
is some input to B that visits q and has values a' and b for nodes 2i and 2i + 1. Likewise, let 
LeftThrifty(g) be the set of integers b' such that there is some input that visits q and has values a 
and b' for nodes 2i and 2i + 1. Theorem [6] is the special case of the following result when it — 1. 

Theorem 7. For any h,k > 2 and n < k, if B is a deterministic BP that solves BT h (k) such that 
|LeftThrifty(g)| < ir and |RightThrifty(g)| < rr for every state q that queries an internal node, 
then B has at least k h /n h ~ 2 states. 

Proof. We modify the proof of Theorem [6] We first need to verify that the analogue of Lemma [9] 
for this context holds: 

9 This makes sense because every node in BnPath other than i sc has a child in BnPath. 

10 v 2ii by definition of g mlt since 2i is the left child of a node in Learnable. Also X = /i(ufi) b) for some b since 
<7i„it is not defined on X. Also wf i+1 i = b- since X is not wrong-wrong for any / G g{D), it must be thrifty for every 
I e g{D). 
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Lemma 11. For any I and internal node i, there is at least one state q on the computation path of 
I that queries the thrifty i variable of J, and for every such q, for each child j of i, there is a state 
on the computation path of I before q that queries the thrifty j variable of I. 

Proof. We use the strategy from the proof of Lemma H] on page OH I must visit at least one state 
that queries its thrifty root variable, since otherwise B would make a mistake on an input J that 
is identical to / except f({v^ v%) = k — f((vl, Now let q be a state on the computation path 
of I that queries the thrifty i variable of /, for some internal node i. Suppose the lemma does 
not hold for this q, and wlog assume there is no earlier state that queries the thrifty 2% variable 
of I. For every a ^ there is an input J a that is identical to J except = a. This implies 
|RightThrifty(g)| = k, contradicting the assumption that |RightThrifty(g)| < n < k. □ 

The assignment of pebbling sequences to inputs and the definition of supercritical states is the 
same. In fact nothing more needs to be changed until the statement of the Main Lemma, which is 
now: 

Lemma 12 (Main Lemma). For some integer M, for every supercritical state r G R, there is a set 
of restrictions G[ nit of size at most fel Vars l- M such that {g\n\t{E r )} g m ^GT . is a partition of E r and 
for every g\ n - lt in G[ nit , there is an injective function from g- in \ t (E r ) to [7t] h ~ 2 x [k] M ~ h . 

So in order to cope with the relaxed restrictions on the model, in addition to the [k] -valued 
advice string of length M — h we now have a [7r]-valued advice string of length h — 2. One can 
show the theorem follows from the lemma in the same way as in the proof of Theorem [61 Really at 
this point there is just one additional observation needed to adapt the proof of Theorem[6] Suppose 
we have a set of inputs F all of which have value a for v 2 j (i.e. v^i = a), and all the inputs in F 
visit a state q that queries a variable /j(a, b). Then we can use the elements of [tt] to code the values 
of V2i+i for inputs in F. More concretely, let oi, . . . , a m be the m < n integers LeftThrifty(g) in 
increasing order. Then to each / 6 F we assign the index of v^ i+1 in a±, . . . , a m . Of course a 
similar property holds for the case when F is a set of inputs that agree on i>2i+i- We use this 
observation later to show that if we "know" the value of node 2i upon reaching q, then we can 
learn node 2i + 1 with the help of just an element of 7r-valued advice, and similarly for learning 
node 2%. 

The definition of G[ nit is the same, and as before we fix r G R and g init G G] n]t and then define a 
procedure that interprets some given advice as the code of an input in D := g- m ; t (E r ). The analogue 
of Proposition [Tl (page |2TI) is: 

Proposition 2. For every I G D, there is some restriction g that extends g\ n \ t , a [n]-valued advice 
string adv^ of length h — 2 and a [k]-valued advice string adv^ of length at most M — h, such that 
InterAdv (adv^, adv^)| = g and I G g(D) and |Dom(g) — Dom(g init )| > |adv 7 1 + h. 

However, it will be convenient to instead give a procedure InterAdv' for which the following 
superficially different proposition holds: 

Proposition 3. For every I G D, there is some restriction g that extends g- m \ t , a \n\-valued advice 
string adv^ of length at least h — 2 and a [k]-valued advice string adv^ of length at most M — 
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|adv^| - 2, such that lNTERADv'(adv£, adv^.)4, = g and I G g(D) and |Dom(g) - Dom(g in j t )| > 
|adv 7 | + |adv[| + 2. 

To get the procedure InterAdv of Proposition [2] from the procedure InterAdv' of Propo- 
sition |3l just run InterAdv' until h — 2 elements of the [7r]-valued advice have been used, and 
then, if necessary, use elements of the [A;] -valued advice whenever an additional element of the 
[7r]-valued advice is required. This works since tc < k and |adv^| < M — | adv^ | — 2. 

Let us say q is right-thrifty for / if q queries a variable fi(a, b) such that b = i>f i+1 and a ^ v^. 
Similarly define left-thrifty for /. Previously, while interpreting the advice for I we only learned 
node values at states that are thrifty for /. Now we may learn node values at states that are thrifty, 
right-thrifty, or left-thrifty for /. As before, we always learn the children of i sc , and the remaining 
h — 2 nodes we learn are in Learnable*. 

First we consider the case of learning a node in SiblBnPath. We consider the case of learning 
a left child 2i - the case of learning a right child is similar. Let q be the first state in Path / (r) that 
queries the thrifty 2i + 1 variable of /. If we learn 2i, then we do so at the first state q' after q 
that queries an i variable that is thrifty or right-thrifty for I. Now we consider the case of learning 
a node in Learnable* — SiblBnPath. Every node in Learnable* — SiblBnPath is a right child, so 
suppose we are learning 2i+l. Then we do so at the first state in Path / (r) that queries an i variable 
that is thrifty or left-thrifty for /. 

As before, for each / in D and each of the h — 2 nodes i in SiblBnPath, we will learn at least 
one node in RightPath^ (and of course we still learn the children of the supercritical node i sc ). This 
is again proved using Facts |31 (page [T9b and l4l (page l23l); both still hold since we did not change the 
assignment of pebbling sequences to inputs. 

We provide pseudocode for InterAdv', just in case the reader has questions not explic- 
itly addressed in the preceding prose. On the other hand, there is little to read since it differs 
from the previous definition of InterAdv (14.2.11 on page|23]> only in a few lines near the two 
calls to LearnNode (specifically lines [121 - [151 and |2T1 - 124|). The two subprocedures FILL and 
LearnNode do not use the [n] -valued advice and do not need to be modified. 

Procedure InterAdv '(a^ e [k]*,^ e [n]*): 

1 : // Note the advice string and the current index into it are both implicit arguments in every 

call to Fill and LearnNode. 

2: q^r, g <- g init 

3: while q is not an output state do 

4: % node(g), X ^— var(g) 

5: if X £ Dom(g) then 

6: if i = i sc then 

7: add l 2lsc ^ and l 2isc+ i ^ v$£ +1 to g 

8: else if i e BnPath — i sc then 

9: let j be the child of i in BnPath 

10: let a 2i , a 2i+ i be such that X = fi(a 2i , a 2i+ i) 

11: if Vjl = dj and g does not constrain f s ibi(j) then 

12: let z be the next element of the [7r]-valued advice. 
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13: if j = 2i + 1 then let 6 be the z-th greatest integer in RightThrifty(g) and otherwise 

let 6 be the 2-th greatest integer in LeftThrifty(g) 

14: // Uses |Vars(descendants(sibl(j))) / Dom(g)| elements of [k]-valued advice: 

15: LearnNode(^, sibl(j), b) 

16: else 

17: FlLL(g, {X}) II Uses one element of [k]-valued advice. 

18: end if 

19: else//z G Learnable* 

20: if i is an internal node and g does not constraint v 2 i+i then 

21: let z be the next element of the [it] -valued advice. 

22: let b be the 2-th greatest integer in LeftThrifty(g). 

23: // Uses |Vars(descendants(2z + 1)) / Dom(g)| elements of [k]-valued advice: 

24: LEARNNODE(g, 2? + 1,6) 

25: else 

26: FlLL(g, {X}) II Uses one element of [k]-valued advice. 

27: end if 

28: end if 
29: end if 

30: q <— the state reached by taking the edge out of q labeled g(X) 
31: end while 
32: return g 

□ 

We give one more extension of the thrifty lower bound. We introduce another parameter 
w: for each input /, we require that there are at most w nodes i such that / visits a state q 
with |RightThrifty(g)| > 1 or |LeftThrifty(g)| > 1. The motivation for this is that for w = 1 
and 7r = log k — log log k, the model includes BPs that achieve the best known upper bounds 
for BT h {k), namely 0(k h / log k). For those parameters the theorem gives a lower bound of 
k h j (log k — log log k) = VL(k h / log k). In [|BCM + 09bl it was shown that the minimum number of 
states for unrestricted deterministic BPs solving BT 3 (k) is Q(k 3 / log k). 

Theorem 8. For any h, k > 2 and tt < k and w < h — 2, if B is a deterministic BP that solves 
BT h (k) such that |LeftThrifty(g)| < n and |RightThrifty(g)| < it for every state q that queries an 
internal node, and such that for every input I there are at most w nodes i such that I visits a state 
q that queries an i-variable and has |RightThrifty(g)| > 1 or |LeftThrifty(g)| > 1, then B has at 
least k h /n w states. 

Proof. This is an easy modification of the proof of the previous result. Note that RightThrifty(g) 
and LeftThrifty(g) are properties of B, so independently of the advice we can label each internal 
node querying state q with the quanities |RightThrifty(g)| and |LeftThrifty(g)|. If for some advice 
q is a learning state, then we will use an element of the 7r-valued advice iff | RightThrifty(g) > 1| 
or |LeftThrifty(g) > 1|. Hence, for any input I in E r , we can define the advice for / so that for all 
but at most w of the h — 2 learning states q of I after r, we do not need to use an element of the 
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[7r]-valued advice to learn a child of node(g). So we only need a [7r]-valued advice string of length 
w. □ 



5 Open Problems 

The first is a problem that can, in principle, be resolved using a computer. 

1. Show that for some k, h there is a deterministic branching program with fewer than (k + l) h 
states that solves FT h {k). 

Theorem |4] suggests the following conjecture: for all h, nondeterministic thrifty branching pro- 
grams solving FT h (k) require fi(fc #FRpebbles(T ' 1) ) states. 

2. Refute it, with or without the thrifty restriction. 
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